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Abstract. More than a century ago, G. Kowalewski stated that for each n 
continuous functions on a compact interval [a,b], there exists an n-point quad- 
rature rule (with respect to Lebesgue measure on [a, b]), which is exact for given 
functions. Here we generalize this result to continuous functions with an arbi- 
trary positive and finite measure on an arbitrary interval. The proof relies on 
a version of Caratheodory's convex hull theorem for a continuous curve, that 
we also prove in the paper. As applications, we give a representation of the co- 
variance for two continuous functions of a random variable, and a most general 
version of Griiss' inequality. 

1. Introduction and Main Results 

More than 110 years ago, G. Kowalewski published the following result, in a 
paper entitled (in translation from German language) "A mean value theorem for 
a system of n integrals" . 

Theorem l.[4] Let x%, . . . ,x n be continuous functions in a variable t G [a, b]. 
There exist real numbers t\, . . . , t n in [a, b] and non-negative numbers Ai, . . . , A n , 
with Y^i=i A; = — a, such that 

i-b 

X]~(t) dt = \iXk(ti) + • • • + \ n X] i {t n ) 1 for each k = 1,2, . . .n . 



In [5], Kowalewski generalized Theorem 1, with dt replaced with F(t) dt, where 

F is continuous and of the same sign in (a, b), and with J27=i ^ = ll ^X*) dt. ^ 
seems that these results have not found their proper place in the literature; they 
were simply forgotten. Except citations to Kowalewski's Theorem 1 in [2] and 
[9], related to Griiss' and Chebychev's inequalities, we were not able to trace any 
other attempt to use, or to generalize these results. In fact, Theorem 1 and its 
generalization, presented in [HI5], were honestly proved there only for n = 2, using 
a theorem (attributed to K. Weierstrass) from Hermite's course in Analysis pQ. 
Nevertheless, there is an appealing beauty, and a potential for applications in those 
statements. 

In this paper, we offer a generalization of Theorem 1, for arbitrary interval / (not 
necessarily finite), with respect to any positive finite measure, and with functions 
Xi that are continuous, but (if I is open or infinite) not necessarily bounded. 

Our main result is the following theorem. 

Theorem 2. For an interval / C I, let /i be a finite positive measure on the 
Borel sigma-field of I. Let Xk, k = 1, . . . , n, n > 1, be continuous functions on L, 
integrable on J with respect to the measure /i. Then there exist points t%, . . . ,t n in 
I, and non-negative numbers Ai, . . . , A n , with X)"=i ^» = m(-0> such that 
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/ x k (t) d(j,(t) = ^ Kx k (ti), k = l,...,n. 
^ i=i 

In Section 2, we prove Theorem 2 via the following version of Caratheodory's 
convex hull theorem, which also can be of an independent interest. We show that 
each point in the convex hull of a continuous curve in R™ is a convex combination 
of n points of the curve, rather than of n + 1 points, which would follow from the 
classical Caratheodory's theorem. 

Theorem 3. Let C : t <— » x(t), t 6 I , be a continuous curve in R™, where JcM 
is an interval, and let K be the convex hull of the curve C . Then each v £ K can 
be represented as a convex combination of n or fewer points of the curve C. 

In Section 3, we discuss Theorem 2 in the context of quadrature rules, and in Sec- 
tion 4 we apply Theorem 2 to derive a representation of the covariance for functions 
of a random variable, and to obtain a most general form of Gruss' inequality. 

2. Proofs of Theorems 2 and 3 

Proof of Theorem 3. According to Caratheodory's theorem, any point v G K 
can be represented as a convex combination of at most n + 1 points of the curve 
C. Therefore, there exist real numbers tj 6 / and vj > 0, < j < n, such that 
t < ti < ■ ■ ■ < t n , Vq + ■■■ + v n = 1, and 

(1) v = v x(t ) + vxx(ti) H \-v n x(t n ). 

In the sequel, we assume that all n + 1 points x(tj) do not belong to one hyperplane, 
and that the numbers Vj are all positive; otherwise, one term from ([1]) can be 
obviously eliminated. Denote by Pj(x), < j < n, the coordinates of the vector x £ 
R™ with respect to the coordinate system with the origin at v, and with the vector 
base consisted of vectors x(tj) —v,j = l,...,n (that is, x = S?=i Pj( x )( x (tj) ~ v ))- 
Since 

vo (x(t ) - v ) = -vi (x(ti) -v) v n (x(t n ) - v) , 

we have that pj (x(to) — v) = —Vj/vo < 0, j = 1, . . . , n, i.e. the coordinates of the 
vector x(to) — v are negative. The coordinates of vectors x(tj) — v, j = 1, 2, . . . , n 
are non-negative: pj (x(tj) — v) = 1 and p k (x(tj) — v) = for k ^ j. Since the 
functions t t— > pj (x(t) — v) are continuous, the set of points t > to at which at 
least one of these functions reaches zero is closed, and since it is nonempty, it 
has the minimum. Denoting that minimum by t, we conclude that the numbers 
Pj (x(t) — v), j — 1, . . . , n, are non-positive and at least one of them is zero. Let 
Pk (x(t) — v) = and pj (x(t) — v) < for j ^ k. From 

k— 1 n 

x{t)~v = ^pj (x(t) - v) (x(tj) - v) + ^2 pj (x(t) - v) (x(tj) - v), 

3=1 j=k+l 

it follows that 

fe-i 

l-£pj (»(*)-«) 



Pj (x(t) - v) J V = 

j=k+l 



3 



k—1 n 

x(t)-^2pj(x(t)-v)x[tj)- Pj(x(i)-v)x{tj), 

3=1 ]=k+l 

wherefrom it follows that v is a convex combination of points x(i) and x(tj), j = 
l,...,n, j ^ k. 

Proof of Theorem 2. For given continuous and /x-integrable functions Xk, 
k = l,...,n, defined on an interval /, let C be the curve in W 1 parametrized 
with x\ = x\(t), . . . , x n — x n {t), t £ I. Without loss of generality, we prove the 
statement of Theorem 2 under the following two assumptions. 
Al The measure /i is probability measure, i.e, = 1; 

A2 The functions X\, . . . ,x n are not ^t-a.e affine dependent. That is, for any 
hyperplane ir : ceiXi + • • • + a n x n = 0, with Y^,i=i °H > 0> Ml'' ^ I I x (t) $ 
tt} > 0. 

If the assumption A2 does not hold for the given set of functions Xi, . . . ,x n , it 
suffices to prove the theorem for the maximal subset, say x±, . . . , x ni , n\ < n, of 
functions that satisfy A2, and then it follows automatically for the remaining ones, 
which can be expressed [i— a.e. as affine combinations of X\, . . . , x ni . In particular 
case n = 1, the condition A2 implies that the function x\ is not a constant /i-a.c; 
if it is, the theorem is trivially true. 

Let 



(2) 



J := (Ji, . . . , J„), where J k = J x k (s) d/i(s), k = 1, 



Let K be the convex hull of C. The point J £ M. n , defined in belongs to 
the closure K. To prove that, let us first suppose that / is a finite closed interval 
[a, b]. For a positive integer to, let s| , i = 0,...,to be points in the interval 
[a, 6], such that a = sq < s^'™' < ■■• < sfc 1 ' = b, and let /i-™ 1 ' = /x([s- m ' 1 , sj+i)), 
i = 0, . . . ,ra — 2 and /i^!:! = M([ s tn^i! s ro ])■ Let us define 

m — 1 

(3) jH := (j{'">...,Jn where jf> = £ z^V^ fc = 1, ■ ■ ■ , n. 

i=0 

By continuity of functions x k on [a, 6], integrals J k are limits as m — > +oo of 

fc 



integral sums J k m \ i.e., 



J= lim j( m \ 

m — >+oo 

with points sf^ chosen, for example, equidistantly. Since J^S) 1 /-*! = L eacn 
j( m ) belongs to K, hence, J £ K. 

For an open or infinite interval /, there exists a sequence of closed intervals 
I p , p = 1,2,..., such that l\ C 1% C • • • and U p I p = I. Since functions x k are 
integrable on /, we have that 

(4) lim / Xk (t)^- = J k , fc=l,...,n. 

As the n-dimensional vector of integrals under the limit in (HJ) belongs to K, so 
does the vector J = (Ji, . . . , J n ). This proves that J £ K for arbitrary interval /. 

o 

Now we will show that, in fact, J £K- Indeed, if J were in the boundary of the 
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convex set K, then it would have existed a hyperplane 7r containing J, such that 
the points of K , and in particular, all points of the curve C, lie in one side of n. 
More precisely, there would have existed real numbers ax, . . . , a n , at least one of 
them being non-zero, such that 

(5) ax(xx(t) — J\) H \-a n {x n (t) - J n ) > 0, for all* e J. 

By the assumption A2, the strict inequality in ([5]) should hold on a subset I' C I 
with [i(I') > 0; hence, the integral over / of the left hand side in (fSJ) would have 
been strictly positive, but it is zero. Therefore, J does not lie on the boundary of 
K. 

o 

Now, using Theorem 3, and the fact that J GKC K, we conclude that J can be 
expressed as a convex combination of not more than n points of the curve C, which 
ends the proof. 

3. Theorem 2 from a viewpoint of quadrature rules 

Theorem 2 claims that, given any set of continuous functions on /, and a finite 
measure fi on /, there exists a (at most 7i-point) quadrature rule which is exact 
for those functions. As it can be seen by inspection of the proofs in Section 2, 
this quadrature rule is not unique; a point in the interior of a convex hull can be 
expressed as a convex combination in infinitely many ways. This interpretation of 
Theorem 2 can be compared with a well known result from [5] , regarding Gaussian 
quadratures with respect to Chebyshev systems of functions. A brief explanation 
of these terms is in order. 

Real functions Xi,... ,x m defined on an interval [a, 6] are said (see [3]) to con- 
stitute a Chebyshev system on [a, b] if all functions are continuous on [a, b] and 



(6) 



xi{h) 



x 1 (t 2 ) . 

X2 (*2) • 



•El {tm) 
X2(tm) 



for any choice of points t±, . . . , t m G [a, b] with ti tj whenever i ^ j. A classical 
example of a Chebyshev system on any interval [a, b] is furnished with functions 
Xi(t) — t 1 ^ 1 , i — 1, . . . ,m. The condition ^ is equivalent to the requirement that 
no to points of the curve parametrized with x± = x±(t), . . . , x m = x m (t), t £ [a, b] 
belong to a hyperplane which contains the origin. Another way to express (j6]) is 
to require that any function of the form g(t) — c\Xi(t) + • ■ ■ + c m x m (t), a G K, 
Y^lLi c i > 0? must not have more than n — 1 different zeros on [a, b], 

According to [5] , for given positive and finite measure fj, on [a, b] , a quadrature 
rule of the form 



(7) 



f(s)d»(s)=J2 A kf(tk)+Rn(f), 4 6 1, t k G[a,b] 
^< h ] fe=l 



, X2n if © is exact 
2n. A quadrature 



is called Gaussian with respect to a collection of functions x\ 
for all functions Xi in place of /, i.e. R n (xi) — for i = 1, 
rule of the form |0 is determined by a choice of coefficients Aj~ and points tk, 
k = 1, . . . , n. 
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The next theorem, which can be derived from 3, Chapter 2], claims the existence 
and uniqueness of a Gaussian quadrature rule with respect to a Chebyshev system 
of continuous functions x\, . . . , x 2n on [a, b]. 

Theorem 4. 3] There exists a unique n-point Gaussian quadrature rule Ffy with 
respect to any Chebyshev system of continuous functions X±,...,X2n on a finite 
interval [a, b]. Moreover, all coefficients A±, . . . , A n are positive. 

There are variations and generalizations of Theorem 4 in various directions, see, 
for example, [6] or recent paper [7]. 

Clearly, Theorem 4 yields a particular case of Theorem 2 if functions xi, ■ ■ ■ , x n 
can be complemented with suitably chosen functions (for example 1 , £, t 2 , . . . ,t n ) 
to make a Chebyshev system of 2n functions on interval [a, b]. However, Theorem 
2 is much more general, it is not limited to compact intervals, it allows unbounded 
functions, and does not require the condition ©, which is very restrictive and 
difficult to check. The price payed for the generality is the fact that an n-point 
quadrature rule claimed in Theorem 2 is exact for n functions instead of 2n, as in 
Theorem 4. 

4. A REPRESENTATION OF COVARIANCE AND GENERALIZED GRUSS' INEQUALITY 

As an application of Theorem 2, we give a representation of the covariance of 
random variables f(X) and g(X), where / and g are continuous functions on an 
interval Jcl, and X is a random variable concentrated on I. The idea goes back 
to Karamata [2] , who used Theorem 1 to prove a statement of our next theorem in 
particular case of uniform distribution of X on a compact interval. 

As usual, the expectation operator E is defined as 

Ef(X) = / f(t)dn x (t), 
Jr. 

where / is a measurable function, and jix is a probability measure induced by X 
on the Borel sigma field of ML We say that B C K is a support of X, or that X 
is concentrated on B, if pix(B) = 1. If X is concentrated on B, then the domain 
of integration (and the domain of /) can be taken to be B. The covariance for 
random variables U and V is defined as 

Cov(U,V) = E(U -EZ7)(V -EV) = E (UV) — E Z7E V. 

Theorem 5. Let X be a real valued random variable concentrated on an 
interval ICR. Suppose that f and g are continuous functions on I , such that 
f(X) and g{X) have finite second order moments. Then there exist tx,t2 £ I, such 
that 

(8) Cov {f{X), g{X)) = ~ (/(<!) - f(t 2 )) (9(h) - g{t 2 )) . 

Proof. We will use Theorem 2 with n = 2, with functions x\(t) = (f(t) — 
E f(X))(g(t) —~Eg(X)) and Xzif) — fit), and with the probability measure /i = nx 
induced by the random variable X. Using simplified notations F = Ef(X) and 
G = Eg(X), we find that 



Cov(f(X),g(X)) = X(f(t 1 )-F)(g(t 1 )-G) + (l-X)(f(t 2 )-F)(g(t 2 )-G) 
F = A/(ti) + (1 - X)f(h), 
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for some A > and ti^tz G [a,b\. Replacing F in the first equality with the right 
hand side of the second one, we conclude that 

(9) E f(X)g(X) - FG = A(l - A)(/(t 1 ) - f(h))(g(t x ) - g(t 2 )), 

for some A £ [0,1] and some t\,t2 £ /. By continuity of / and g, we can find 
another two numbers, call them again ti,t 2 , so that holds true with A = 1/2, 
that is, with the maximal possible value A(l — A) = 1/4. □ 

A well known Griiss' inequality (see [H] for a survey) can be stated in terms of 
an inequality for the covariance for f(X) and g(X), where X is a random variable. 
In its original formulation, it claims that 

(10) \Coy(f(X),g(X))\<^(M f -m f )(M g -m g ), 

where X has a uniform distribution on a compact interval [a, b], f and g are con- 
tinuous functions on [a, 6], with m/ < fit) < Mf and m g < g(t) < M g for t G [a, b]. 
There has been a lot of related research, and many different versions of (fTO)) are 
known. The following generalization of Griiss' inequality is an immediate corollary 
to Theorem 5. 

Theorem 6. Let X be a real valued random variable concentrated on an 
interval ICR. Suppose that f and g are continuous functions on I , such that 
m f < f(t) <M f and m g < g(t) < M g for t G I. Then 

(11) |Cov {f{X),g{X))\ < ~ (M f - m f )(M g - m g )) . 



In particular, Theorem 6 yields the following discrete version of Griiss' inequality. 
Let p ni n = 1,2,... be non- negative weights with Xm=a P n = ^ ana * ^ i u n} an< i 
{v n } be bounded sequences of real numbers, with 



u < u n <U, v < v n < V, 



Then 
(12) 



+ 00 



n—1 n—1 n—1 



for each n = 1, 2, . 



< -(U-u)(V-v). 



To prove this inequality, it suffices to observe that there exist functions / and g, 
continuous on R + , such that f(n) = u n and g(n) = v n , and such that u < f(t) < U 
and v < g(t) < V for t > 0. Then (|ll|l . applied with the random variable X which 
takes values n = 1,2,... with probabilities p n yields (|12p . 
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